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Abstract. We give an easy counter-example to Problem 7.20 from C. Vil- 
lani's book on mass transport: in general, the quadratic Wasserstein distance 
between n-fold normalized convolutions of two given measures fails to decrease 
monotonically. 



We use the terminology and notation from [5]. For Borel measures /j,, v on E d 
we define the quadratic Wasserstein distance 

T(n,v) := inf E[||X- Y|| 2 1 
(XX) 

where || • || is the Euclidean distance on M. d and the pairs (X,Y) run through all 
random vectors defined on some common probabilistic space (f2, !F, P), such that X 
has distribution /i and Y has distribution v. By a slight abuse of notation we define 
T(U, V) := T(fi, v) for two random vectors U, V, such that U has distribution \x 
and V has distribution v. The following theorem (see [5j Proposition 7.17]) is due 
to Tanaka [4]. 

Theorem 1. For a,t £ I and square integrable random vectors X , Y , X' . Y' 
such that X is independent of Y , and X' is independent of Y' , and M[X] = E[X'] 
or E[Y] = E[Y'], we have 

T{aX + bY, aX' + bY 1 ) < a 2 T(XX') + b 2 T{Y, Y 1 ). 

For a sequence of i.i.d. random vectors (-X"j)jgN we define the normalized partial 
sums 

^ m 

S m := -=y^Xi, meN. 
V rri r-f 

If [i denotes the law of X%, we write for the law of S m . Clearly fj,^ m ' equals, 
up to the scaling factor ^/m, the m-fold convolution fj, * fi * • • • * fj, of fx. 

We shall always deal with measures v with vanishing barycenter. Given two 
measures /x and v on R d with finite second moments, we let (Xj)j £ ^ and (X-)i 6 N 
be i.i.d. sequences with law \x and v, respectively, and denote by S m and S' m the 
corresponding normalized partial sums. From Theorem [1] we obtain 

T(/i (2m V (2m) ) <T((J, (m) ,v (m) ), meN, 
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from which one may quickly deduce a proof of the Central Limit Theorem (compare 
[3 Ch. 7.4] and the references given there). 

However, we can not deduce from Theorem [1] that the inequality 

(!) T(^ m+1 \i/ m+1 )) <T(^ m \v^) 

holds true for all to € N. Specializing to the case to — 2, an estimate, which we 
can obtain from Tanaka's Theorem, is 

T(m ( V (3) ) < \[2T{^M 2) ) +TM] < T{p,u). 

This contains some valid information, but does not imply |T]). It was posed as 
Problem 7.20 of [5], whether inequality ([I]) holds true for all probability measures 
v on R d and all m e N. 

The subsequent easy example shows that the answer is no, even for d = 1 and 
symmetric measures. We can choose /i = p n and v = v n for sufficiently large n > 2, 
as the proposition (see also Remark [1]) shows. 

Proposition 1. Denote by p, n the distribution of Y)i—i 1 Z ir & n d by v n the distri- 
bution of Y%=i Zi with (Zi)ieN i.i.d. and ¥{Z 1 = 1) = P(^i = -1) = \. Then 

(2) lim y/nT(p n * p n , v n * u n ) = 



/2tt 

while T(/i„ * p n * /i„, v n * v n * v n ) > 1 /or a/Z n G N. 

Remark 1. // one only wants to find a counter-example to Problem 7.20 of [5], 
one does not really need the full strength of Proposition [71 i. e. the estimate that 
T(fi n * v n * v n ) — 0(1/ \fn). In fact, it is sufficient to consider the case n = 2 
in order to contradict the monotonicity of inequality {T]) . Indeed, a direct calculation 
reveals that 

2 / \/2\ 2 

T(p 2 * * v 2 ) = 0.625 < - < ( -j= \ T(p 2 * M2 * A*2, v 2 * v 2 * v 2 ). 

Proof of Proposition^ We start with the final assertion, which is easy to show. 
The 3-fold convolutions of the measures p n and v n , respectively, are supported 
on odd and even numbers, respectively. Hence they have disjoint supports with 
distance 1 and so the quadratic transportation costs are bounded from below by 1. 

For the proof of fix n 6 N, define a n — p n * p n and r„ = v n * z/„, and note 
that a n and r„ are supported by the even numbers. For k = —(2n— 1), . . . , (2n— 1) 
we denote by p n ^ the probability of the point 2k under a n , i.e. 

4n - 2 \ 1 



Pn,k = 



k + 2n-l) 2 4 "- 



-2 ' 



We define p n ,k — for |fc| > 2n. We have r„ = a n * p, where p is the distribution 
giving probability j, i, \ to —2, 0, 2, respectively. We deduce that for < k < 
2n - 2, 

, 1 1 1 

T n (2k + 2) = -p„.k + jPn,fe+2 + -jPn,k+l 

(3) = j(Pn,k ~ Pn,k+l) + j(Pn,fc+2 ~ Pn,k+l) + O n (2k + 2) 

1 ZVfc+1 \ 1 



1 Pn.k+l\ . 1 fPn,k+2 -,\ . , , . x 

~.Pn,k 1 1 + TPn,t+l ; 1 +0- n (2k + 2). 

4 V Pn,k > 4 Vp„,fe+1 / 
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Notice that p n ,k > p n ,k+i for < k < 2n — 1. The term in the first parentheses is 
therefore non-negative. It can easily be calculated and estimated via 

Q < 1 _ Pn,k+i =1 _ (fc+2n) = 1 _ 2n-k-\ = 2fc + 1 < 2fc + 1 

Pn.k L+2n-l) k + 2 " 2 " + * ~ 2 " ' 

for < k < 2n- 1. 

Following [5] we know that the quadratic Wasserstein distance T can be given 
by a cyclically monotone transport plan tt — ir n . We define the transport plan it 
via an intuitive transport map T. It is sufficient to define T for < k < 2n — 1, 
since it acts symmetrically on the negative side. T moves mass \p n ,k 2n+k f rom 
point 2k to 2fc + 2 for fe > 1. At fc = the transport T moves g^-Pn,o to every side, 
which is possible, since there is enough mass concentrated at 0. 

By equation ([3]) we see that the transport T moves a n to r n , since, for 1 < 
k < 2n — 2, the first terms corresponds to the mass, which arrives from the left 
and is added to cr n , and the second term to the mass, which is transported away: 
summing up one obtains r n . For k = 2n — 1, mass only arrives from the left. At 
k = mass is only transported away. By the symmetry of the problem around 
and by the quadratic nature of the cost function (the distance of the transport is 2, 
hence cost 2 2 ), we finally have 

, "^ 2 2 2k + l 2 ^ 2k+l 
T{a n ,r n ) < 2 2^ -jP^k^—^ < 2^ Pn ' k ^^- 

k=0 k=0 

By the Central Limit Theorem and uniform integrability of the function x > x + := 
max(0, x) with respect to the binomial approximations, we obtain 

1 2n_1 f°° X 2 

lim —= V {2k)p nM = I -^e~ x2/2 dx. 
n^oo 2^i ^ J V2V 

Hence 

2 

limsupV^^"(o-„,r„) < -= w 0.79788. 



2tt 

In order to obtain equality we start from the local monotonicity of the respective 
transport maps on non-positive and non-negative numbers. It easily follows that 
the given transport plan is cyclically monotone and hence optimal (see [51 Ch. 2]). 
The subsequent equality allows also to consider estimates from below. Rewriting 
© yields 

r n (2k + 2) = - A p n . k+1 (_^L_ _ i] + -p n ^ k+2 (l - P?hh±l\ + an {2k + 2) 

4 \p n ,k+l / 4 V PnM+2> 

for < k < 2n — 3, and 

r n (2k + 2) = \ Pn . k +i f-^- - l) + <J n (2k + 2) 
for k = 2n — 2. Furthermore, 

Pn,k _ 1= \k+2n-i) _ 1 = k + 2n 2k + 1 > 2k + 1 



n fin— 2\ 

5 ". fc +! \k+2n) 



f«-2\ 2n - k - 1 2n - jfe - 1 ~ 2n 
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for < k < 2n — 2. This yields by a reasoning similar to the above that 

2n-2 

Pn,k+1 

fc=0 

hence 

lim inf y/ri T(a n ,r n ) > 

n — >oo 

□ 

Remark 2. Let p > 2 be an integer. By slight modifications of the proof of Propo- 
sition^ we can construct sequences of measures (/i n )n£N an d { v n)n&h such that 
the quadratic Wasser stein distances of k-fold convolutions are bounded from below 
by 1 for all k which are not multiples of p, while 

lim T($\ V W)=0. 

71 — >OG 

Remark 3. Assume the notations of [5]- In the previous considerations we can 
replace the quadratic cost function by any other lower semi-continuous cost function 
c : M 2 — > [0, +oo] ; which is bounded on parallels to the diagonal and vanishes on 
the diagonal. For example, if we choose c(x,y) — \x — y\ r for < r < oo, then we 
obtain the same asymptotics as in Proposition^ (with a different constant). 

Remark 4. We have used in the above proof that r„ is obtained from a n by con- 
volving with the measure p. In fact, this theme goes back (at least) as far as L. 
Bachelier's famous thesis from 1900 on option pricing [2j p. 45]. Strictly speaking, 
L. Bachelier deals with the measure assigning mass \ t° ~ lj 1 an d considers con- 
secutive convolutions, instead of the above p. Hence convolutions with p correspond 
to Bachelier's result after two time steps. Bachelier makes the crucial observation 
that this convolution leads to a radiation of probabilities: Each stock price x radi- 
ates during a time unit to its neighboring price a quantity of probability proportional 
to the difference of their probabilities. This was essentially the argument which al- 
lowed us to prove (fT]) . Let us mention that Bachelier uses this argument to derive 
the fundamental relation between Brownian motion ( which he was the first to define 
and analyse in his thesis) and the heat equation (compare e.g. [3] for more on this 
topic). 

Remark 5. Having established the above counterexample, it becomes clear how 
to modify Problem 7.20 from [5] to give it a chance to hold true. This possible 
modification was also pointed out to us by C. Villani. 

Problem 1. Let p be a probability measure on R d with finite second moment 
and vanishing barycenter, and 7 the Gaussian measure with same first and second 
moments. Does (T(p^ n \j)) n >i decrease monotonically to zero? 

When entropy is considered instead of the quadratic Wasserstein distance the 
corresponding question on monotonicity was answered affirmatively in the recent 
paper pQ. 

One may also formulate a variant of Problem 7.20 as given in (fT]) by replacing 
the measure v through a log-concave probability distribution. This would again 
generalize problem 1. 



V2n 
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